Spaces of RNA Secondary Structures
نویسنده
چکیده
We prove two topological theorems in physical chemistry. Namely, we introduce a hybrid of transverse and tangential measures on train tracks to prove sphericity of various simplicial complexes which arise from certain idealized models of physical chemistry. These complexes are at once identified with Thurston’s space of projective geodesic laminations on an ideal polygon and with the analogue of a compactification (described elsewhere) of the moduli space of a punctured Riemann surface. The physical structures we study are various sub-collections of the set of all possible planar chemical bonds among the sites of a linear macromolecule. Each such collection we consider has a natural partial ordering, and the geometric realizations of appropriate posets are shown to be topological spheres. Such a topological statement encodes a wealth of combinatorial data, as we briefly discuss. In fact, our primary motivation here is to study secondary structures on RNA. This imposes the further restriction that there can be at most one base-pair supported at a given site of underlying linear macromolecule, and imposing this restriction leads to the class of “binary macromolecules.” Our main results here assert the sphericity of certain topological spaces of both arbitrary and binary macromolecules, and it is the latter which we hope may have applications to RNA. Our techniques are largely elaborations of elementary topological techniques from Techmiiller theory and the theory of train tracks.
منابع مشابه
Relation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملPreRkTAG: Prediction of RNA Knotted Structures Using Tree Adjoining Grammars
Background: RNA molecules play many important regulatory, catalytic and structural <span style="font-variant: normal; font-style: norma...
متن کاملExploring the Space of Consensus RNA Secondary Structure Motifs Using Suffix Arrays
In the last few years, we have seen a rapid increase of the number of known RNA families. For a significant fraction of them, the mechanisms of action remain unclear. Their signature combines structure and sequence information. In most cases, they are difficult to identify from sequence alone. Traditional approaches to identify RNA motifs seek to find a conserved structure with minimum free ene...
متن کاملPrediction of RNA secondary structures: from theory to models and real molecules
RNA secondary structures are derived from RNA sequences, which are strings built form the natural four letter nucleotide alphabet, {AUGC}. These coarse-grained structures, in turn, are tantamount to constrained strings over a three letter alphabet. Hence, the secondary structures are discrete objects and the number of sequences always exceeds the number of structures. The sequences built from t...
متن کاملA zero one programming model for RNA structures with arclength ≥ 4
In this paper, we consider RNA structures with arc-length 4 . First, we represent these structures as matrix models and zero-one linearprogramming problems. Then, we obtain an optimal solution for this problemusing an implicit enumeration method. The optimal solution corresponds toan RNA structure with the maximum number of hydrogen bonds.
متن کامل